Show, Recall, and Tell: Image Captioning with Recall Mechanism
نویسندگان
چکیده
منابع مشابه
Show, Discriminate, and Tell: A Discriminatory Image Captioning Model with Deep Neural Networks
Caption generation has long been seen as a difficult problem in Computer Vision and Natural Language Processing. In this paper, we present an image captioning model based on a end-to-end neural framework that combines Convolutional Neural Network and Recurrent Neural Network. Critical to our approach is a ranking objective that attempts to add discriminatory power to the model. Experiments on M...
متن کاملShow, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
The aim of image captioning is to generate similar captions by machine as human do to describe image contents. Despite many efforts, generating discriminative captions for images remains non-trivial. Most traditional approaches imitate the language structure patterns, thus tend to fall into a stereotype of replicating frequent phrases or sentences and neglect unique aspects of each image. In th...
متن کاملRegion-Based Image Interpretation and Recall
In this paper, we present an approach to primarily achieve the semantic interpretation and the region retrieval for an attentive region in a color image. The main components of the system include image feature extraction, indexing process, as well as linguistic inference rules construction and semantic description. Based on these features, each of attentive regions in an image can be described ...
متن کاملRecall termination in free recall.
Although much is known about the dynamics of memory search in the free recall task, relatively little is known about the factors related to recall termination. Reanalyzing individual trial data from 14 prior studies (1,079 participants in 28,015 trials) and defining termination as occurring when a final response is followed by a long nonresponse interval, we observed that termination probabilit...
متن کاملShow-and-Fool: Crafting Adversarial Examples for Neural Image Captioning
Modern neural image captioning systems typically adopt the encoder-decoder framework consisting of two principal components: a convolutional neural network (CNN) for image feature extraction and a recurrent neural network (RNN) for caption generation. Inspired by the robustness analysis of CNN-based image classifiers to adversarial perturbations, we propose Show-and-Fool, a novel algorithm for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the AAAI Conference on Artificial Intelligence
سال: 2020
ISSN: 2374-3468,2159-5399
DOI: 10.1609/aaai.v34i07.6898